TO APPEAR IN IEEE TRANSACTIONS ON COMMUNICATIONS 



1 



Soft-Decision-Driven Channel Estimation 
for Pipelined Turbo Receivers 

Daejung Yoon, Student Member, IEEE and Jaekyun Moon, Fellow, IEEE 



o ■ 

<n : 

Of 

<: 



C/5 



> 
On 

o 

O 



X 



Abstract — We consider channel estimation specific to turbo equal- 
ization for multiple -input multiple-output (M1M0) wireless communi- 
cation. We develop a soft-decision-driven sequential algorithm geared 
to the pipelined turbo equalizer architecture operating on orthogonal 
frequency division multiplexing (OFDM) symbols. One interesting 
feature of the pipelined turbo equalizer is that multiple soft-decisions 
become available at various processing stages. A tricky issue is that 
these multiple decisions from different pipeline stages have varying 
levels of reliability. This paper establishes an effective strategy for 
the channel estimator to track the target channel, while dealing with 
observation sets with different qualities. The resulting algorithm is 
basically a linear sequential estimation algorithm and, as such, is 
Kalman-based in nature. The main difference here, however, is that 
the proposed algorithm employs puncturing on observation samples 
to effectively deal with the inherent correlation among the multiple 
demapper/decoder module outputs that cannot easily be removed 
by the traditional innovations approach. The proposed algorithm 
continuously monitors the quality of the feedback decisions and 
incorporates it in the channel estimation process. The proposed 
channel estimation scheme shows clear performance advantages 
relative to existing channel estimation techniques. 

Index Terms — channel estimation, MIMO-OFDM, turbo equal- 
ization, sequential estimator 



I. Introduction 

Combining the multiple-input multiple-output (MIMO) an- 
tenna method with orthogonal frequency division multiplexing 
(OFDM) and spatial multiplexing is a well-established wire- 
less communication technique. Bit-interleaved coded modula- 
tion (BICM) HI used in conjunction with MIMO-OFDM and 
spatial multiplexing (SM) is particularly effective in exploring 
both spatial diversity and frequency selectivity without signif- 
icant design efforts on specialized codes ll2l, ||3l. Turbo equal- 
ization JU, also known as iterative detection and decoding 
(IDD) in wireless applications (6|, is well-suited for BICM- 
MIMO-OFDM for high data rate transmission with impressive 
performance potentials 0, 0. 

A critical issue in realizing the full performance potential 
of a MIMO-OFDM system is significant performance degra- 
dation due to imperfect channel state information (CSI). The 
detrimental impact of imperfect CSI on MIMO detection is 
well known (see, for example, 0, |8]) and continues to be 
a great challenge in wireless communication system design. 
Previous works have identified desirable training patterns 
or pilot tones for estimating channel responses for MIMO 
systems [9|-| 13 1. However, with these methods the achievable 
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data rate is inevitably reduced, especially when the number of 
channel parameters to be estimated increases (e.g., caused by 
an increased number of antennas). 

Decision-directed (DD) channel estimation algorithms can 
be applied to the turbo receivers to improve channel es- 
timation accuracy 1 14l- fl7l . However, inaccurate feedback 
decisions degrade the estimator performance lfj"8l . Maximum - 
a-posteriori (MAP)-based DD algorithms discussed in [14|, 
lfl5l can improve the estimation accuracy, but they require 
additional information like the channel probability density 
function. The DD channel estimation algorithm jointly work- 
ing with IDD has been actively researched HI 91 - 1231 . Among 
the existing research works, several papers have been devoted 
to iterative expectation-maximization (EM) channel estima- 
tion algorithms using extrinsic or a posteriori information 
fed back from the outer decoder |fl9l - ll2TI . Although the 
traditional EM-based estimation algorithms typically show 
outstanding performance, the heavy computation complexity 
and the iteration latency can be problematic for many practical 
applications. While an approximation scheme as discussed in 
[20 1 can reduce complexity, the performances of these ap- 
proaches suffer from performance degradation as the number 
of antennas increases [ 19 1. Also, the EM estimation algorithms 
need to be aided by pilot-based EM algorithms to guarantee 
a good performance ll20l . ETl . 

As an alternative approach to iterative EM channel estima- 
tion, Kalman-based channel estimators have been discussed 
that are effective against error propagation lE2l . Il23l . The 
authors of 11221 . 1123 1 have introduced a soft-input channel esti- 
mator that adaptively updates the channel estimates depending 
on feedback decision quality. The soft-input channel estimator 
of ll22l evaluates the feedback decision quality by tracking the 
noise variance that includes the potential soft-decision error 
impact in its effort to improve the update process for the 
Kalman filter. 

In the present work, we develop a Kalman-based channel 
estimator for MIMO-OFDM based on a specific pipelined 
turbo equalizer receiver architecture. Before setting up the 
Kalman estimator, a novel method for reducing decision error 
correlation is introduced. The proposed method constructs a 
refined innovation sequence by irregularly puncturing certain 
soft decisions that are deemed to be correlated with the 
previous decisions. The resulting algorithm is basically a linear 
sequential estimation algorithm and, as such, is Kalman-based 
in nature. We also weigh the estimated channel responses in 
the detection process according to the quality of the estimation. 

A critical issue in turbo receiver design is long processing 
latency due to inherent iterative processing of information. 
Pipelined architecture reduces the latency and improves pro- 
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cessing throughput in turbo receivers and thus is the prevailing 
choice of the implementation architecture ll24l . ||25l . One inter- 
esting feature of the pipelined turbo equalizer is that multiple 
sets of soft-decisions become available at various processing 
stages. A tricky issue is that these multiple decisions from 
different pipeline stages have varying levels of reliability. 
Therefore, an adequate optimization strategy is required for 
the estimator to track the target channel while dealing with 
observation sets with different qualities. An optimum channel 
estimator is derived based on this principle for the pipelined 
turbo receiver. 

In demonstrating the viability of the proposed schemes, a 
SM-MIMO-OFDM system is constructed to comply with the 
IEEE 802.1 In high speed WLAN standard (26). Section El 
discusses the channel and system model, and briefly touches 
upon the high-throughput pipelined IDD architecture. Section 
ITlTl discusses the method to set up an improved innovation 
sequence via puncturing. Next, the proposed soft-DD Kalman- 
based channel estimation methods are presented in section [III] 
Mean squared error (MSE) analysis is provided in Section [IV] 
that validates the performance merits of the proposed schemes. 
In Section [Vj the convergence behavior is investigated via 
the extrinsic information transfer (EXIT) charts [28|, and 
packet error rate (PER) simulation results are presented for 
performance evaluation. Finally conclusions are drawn in 
Section [VI] 

II. Channel and System Model 

We assume a SM-MIMO-OFDM transmitter where a data 
bit sequence is encoded by a convolutional channel encoder, 
and the encoded bit stream is divided into N t spatial streams 
by a serial-to-parallel demultiplexer. Each spatial stream is 
interleaved separately, and the interleaved streams are modu- 
lated using an Af-ary quadrature amplitude modulation (M- 
QAM) symbol set A based on the Gray mapping. Since 
Q binary bits form an M-QAM symbol, a binary vector 
b = [&o, bi, ■ ■ ■ , bQN t -i] T is mapped to a transmitted symbol 
vector s = [s\, S2, ■ ■ ■ , SN t } T (with Si £ A), taken from the set 
A Nt , a Cartesian product of A/-QAM constellations. The M- 
QAM symbol sequence in each spatial stream is transmitted 
by an OFDM transmitter utilizing a fixed number of frequency 
subcarriers. For a particular subcarrier for the n th OFDM 
symbol, the received signal at the discrete Fourier transform 
(DFT) output can be written as 



HSr, 



(1) 



where z„ = [zi(n), Z2(n), • • • , zjv r (n)] is the received signal 
vector observed at the N r receive antennas, and H is the 
channel response matrix associated with all wireless links 
connecting Nt transmit antennas with N r receive antennas 
antennas, and n„ is a vector of uncorrected, zero-mean 
additive white Gaussian noise (AWGN) samples of equal 
variance set to J\f Q . 

The IDD technique of Q that performs turbo equalization 
for MIMO systems is assumed at the receiver. The extrinsic 
information on the coded-bit stream is exchanged in the form 
of log-likelihood ratio (LLR) between the soft-input soft- 
output (SISO) decoder and the SISO demapper as shown 
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Fig. 1: Block diagram of the turbo receiver and the 
soft-decision-directed channel estimator 
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Fig. 2: OFDM-symbol processing procedure in the pipelined 
IDD 



in Fig. [U The demapper takes advantage of the reliable 
soft-symbol information made available by the outer SISO 
decoder. A soft-output Viterbi algorithm (SOVA) is used for 
the SISO decoder implementation ||29l . Each data packet 
transmitted typically contains many OFDM symbols, and they 
are processed sequentially by the demapper and the decoder 
as they arrive at the receiver. The feedback decisions used for 
channel estimation must be interleaved coded-bit decisions. 
The extrinsic information from the demapper are rearranged 
accordingly and made available to the channel estimation 
block. The pipelined architecture is adopted to reduce the 
iteration latency 11241 . 11251 . Fig. [2] illustrates OFDM symbols 
processed in the pipelined IDD, and Fig. [3] shows the structure 
of the pipelined IDD receiver and its interface with the 
channel estimator. Multiple demapper-decoder pairs process 
multiple OFDM symbols at different iteration stages. Let N tr 
denote the number of the IDD iterations required to achieve 
satisfactory error rate performance. The Nu r -stage pipelined 
IDD receiver is equipped with Nu r demappers and Nu r 
decoders that are serially connected as in Fig. [3] The decoder 
forwards its extrinsic information output to the demapper in 
the next iteration stage. Simultaneously, the demapper and the 
decoder in the previous iteration stage start to process a new 
OFDM symbol. The pipelined IDD operation is functionally 
equivalent to the original IDD scheme [24|. The extrinsic 
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Fig. 3: Block diagram of the proposed optimum channel estimation algorithm geared to the pipelined IDD 



LLRs released from the pipelined demappers and decoders 
are utilized for the channel estimation. Let N sym denote 
the number of total OFDM symbols in a packet and Nf 
the number of the feedback symbols available for channel 
estimation. Note, however, that not all Nf symbols are used 
for the estimation. If the receiver requires N itr IDD iterations, 
then a maximum of 2Nu r OFDM symbols are processed in 
the pipelined IDD receiver as illustrated in Fig. [3] Because 
the LLR outputs from the initial demapper and decoder have 
low reliability, they are not used for the channel estimation. 
Let index n indicate the time. In this pipelined IDD setup, 
when 2 < n < 2Nu r , the channel estimator can get (n — 2) 
feedback decisions (i.e. Nf — n — 2). When the number of the 
processed symbols increases to 2Nu r (2Nu r < n < N sym ), 
N f is equal to 2N itr - 2. After all the OFDM symbols in 
the packet have arrived at the receiver front-end, it will take 
sometime until all symbols will clear out of the pipeline. For 
n > N sym , N f is equal to N sym + 2N itr - n. 

III. Sequential and Soft-Decision-Directed 
Channel Estimation 

A. Derivation of the Kalman-Based Sequential Channel Esti- 
mation Algorithm 

The sequential form of the estimator is useful in improving 
the quality of the channel estimate as the observed symbols 
arrive in a sequential fashion, as OFDM symbols do in the 
system of our interest. It is assumed that the channel is quasi- 
static over Nf OFDM symbol periods. For the pipelined IDD 
receiver at hand, the observation equation is set up at the r th 
receiver (RX) antenna as 



il r) = S n h (r) 



n 



(r) 



(2) 



(r) 

where z„ is the Nf x 1 received signal vector, S„ is a 
Nf x Nt matrix, h' r ) is a N t x 1 vector that is a multi-input- 
single-output (MISO) channel vector specific to the r th RX 
antenna. The goal is to do a sequential estimation of h' r ^ 
as n progresses. The estimation process is done in parallel 
to obtain channel estimates for all N r RX antennas. With an 
understanding that we focus on a specific RX antenna, the RX 
antenna index r is dropped to reduce notation cluttering. 

A mean symbol decision s is defined as the average of the 
constellation symbols according s = J2s £A Si ^'( Si )' wnere 
P(si) is the "extrinsic probability" obtained from a direct 
conversion of the available extrinsic LLR. 



1) Innovation Sequence Setup: The pipeline architecture 
can be viewed as a buffer large enough to accommodate Nf 
OFDM symbols, but we take into account in our channel 
estimator design the different levels of reliability for the soft 
decisions coming out of the demapper or decoder modules at 
different iteration stages. First, defining the soft decision error 
E = S — S, (O can be rewritten as 



z„ = {S„ + E„}h + n r , 



(3) 



Note that this type of soft decision representation has been 
used previously l22l . We emphasize, however, that unlike in 
ll22l . our derivation of a linear sequential estimator is based 
on the attempt to explicitly generate the innovation sequence. 
As will be clear in the sequel, this approach has led us to 
a realization that the standard steps taken to generate the 
innovations do not work in our set up; this in turn allowed us 
to devise corrective measures. Let us first see if we can find 
x n , the innovations of z n (i.e., the whitened sequence that is 
a causal, as well as a casually invertible, linear transformation 
of z„). We write: 



x„ £ z n -S n h„-t 

= S n (h-h„_ x )+E n h- 



n 7l 



(4) 
(5) 



Ideally, the vector sequence x„ would represent an innovation 
sequence in the sense that any given component of the vector 
x„_fc is orthogonal to any component of x„ as long as k 7^ 0. 
In this scenario we would have 



E [x n - k \i]x* n [j 

n — 



E 



e n _ fc Hh„_ fc _ 1 h w e«[. ? ] 



No 



0) 



Mo when k = and i = j 
otherwise, 



where e n [i] indicates the i row vector E 



At) 

Pn-l 



(6) 



A 



Elh^Lih*^} and of = E[\s — s\ 2 }, the symbol decision error 
variance. The superscript 'H' and the symbol V denote the 
Hermitian transpose and the complex-conjugate, respectively. 
In deriving (O, we assumed: E[x n -k[i](ii — h n -i) H ] = 0, 
E[s[i] e*[j]} = for any k, i and j. In order for this to be 
true, though, the following must hold: 

1) Links in the MISO channel are uncorrelated. 

2) The channel estimate and decision error are independent. 

3) The decision errors are uncorrelated. 
(i.e. E[e n - k \i]e* n \j]] = e, k ^ or i £ j) 
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Fig. 4: Correlations in the 'innovation' sequence: (a) 
£[x„^ 2 x^] (b) £[y„_ 2 yf], c = 0.8 (normalized by 
S[|y„[0]| 2 ], averaging 50 erroneous packets) 



Under these three assumptions, the vector x„ reasonably 
represents an innovation sequence. 

2) Innovation Sequence with Punctured Feedback: As- 
sumption (1) is reasonable, if the RX antennas maintain 
reasonable physical separation. However, assumptions (2) and 
(3) are not convincing. A poor channel estimate generates 
a poor decision, which in turn affects the ability to make 
reliable channel estimation. This makes both assumptions (2) 
and (3) invalid. As such, the Kalman filter is not optimum 
any more, and the correlated error circulates in the IDD and 
channel estimator loop. Our goal here is to provide a refined 
innovation sequence to reduce this error propagation. First, 
we observe that there is no significant correlation between the 
decision errors of the demapper and decoder thanks to the 
interleaver/deinterleaver. An issue is the demapper-demapper 
or decoder-decoder output correlations for a given received 
signal (OFDM symbol), especially when a packet is bad (i.e., 
certain tones cause errors despite persistent IDD efforts). In the 
pipelined IDD setup, it takes n — 2 time steps for a demapper 
decision to shift to the next-stage demapper, and likewise for 
the decoder outputs. Consequently, components in observation 
vectors with even time difference has correlation, as seen in 
FigE](a) between x„ and x„_2. In addition, we cannot assume 
that the noise is random as long as identical observations are 
reused during the iterative channel estimation. 

This correlation in x n is definitely problematic for any 
Kalman estimator design. Imagine removing correlation in x ra 
using the Gram-Schmidt procedure: 

(x n [f], Z„_ 2 [/-2]; 



X n [f] = x n[f] 



\x n -2[f-2\Y 



-X n - 2 [f-2] (7) 



where < a, b > denotes the inner product: < a,b >= 
Re(a)Re(b) + Im(a)Im(b). Now, (|6]l can be rewritten (for 
k = 2 and dropping indices to simplify notation) as 



E [x n -2 x*] = E 



E 



\x n -2\ 



2 {Xn, Xn-2/ 



\Xn-2\ 



E [(x n , X n _ 2 )] 



(8) 



which suggests that using only those samples of x n for which 
(x n , x n _2 ) < e, where e is an adjustable threshold, we 
can limit the amount of correlation in the overall observation 
samples utilized. 



Before delving into the proposed "puncturing" process, 
we note that the amount of puncturing needs be decided 
judiciously, as removing observation samples also tends to 
"harden" the decisions, making the overall system approach 
one of hard decision feedback, a situation we need to avoid. 
Also, one may be tempted to use a more conceptually 
straightforward approach of subtracting out the correlated 
component as suggested by (O or its generalized version 
including subtraction of less correlated components, but we 
had no meaningful success in reducing correlated errors with 
approaches along this direction. 

Equation (0 suggests the following as a measure of corre- 
lation between the previous demapper and current demapper 
outputs (or between the previous decoder and current decoder 
outputs): 



Pn(f) = (Xn-2\f-2], X n [f}) 



(9) 



Now redefine Nd as the number of components among x„(/)'s 
satisfying a threshold condition of 



\fin(f)\ < cMo. 



(10) 



With this condition, let index d now denote the number of 
selected components among Nf feedback symbols (i.e. d = 
0, Nd — 1). The constant c (> 0) is an important parameter 
that controls the puncturing threshold. An improved innovation 
sequence y„ can be written as 



(ID 



where G n is defined as a Nd X Nf puncturing matrix. For 
the d th row vector gi d ' of the puncturing matrix, elements are 
given as 



1, ifp n {f)<cN o ord = f = 0ord = f = l 
0, otherwise. 



(12) 

Note x n [Q] and x n [l] are new input elements from the 
first demapper and decoder outputs, which are automatically 
included in the refined innovation vector. As long as the 
observations are reused during the iterative process, the noise 
correlation is also problematic in the channel estimation. To 
resolve this issue, a scaled noise variance is adopted as a 
threshold criterion to judge minimum correlation, because the 
noise variance term in (O is inevitable. Highly correlated 
signal and noise components are punctured out depending on 
the constant c. 

It is insightful to consider a simple argument based on ran- 
dom puncturing. Suppose the observation samples are dropped 
in a random fashion. Then, the element x n -2[f — 2] can of 
course be excluded from y„-2 by puncturing, and so can x n [/] 
from y n . With random puncturing, the innovation process on 
each element can be analyzed as (dropping index /) 



E [y n -2Vn } = E [g n -2X n -2X* n gn] 

= P [9n-2 = l)E [x n - 2 x* n ] P (g n = 1) 



N 



(n-2) M (n) 



N™E[x n - 2 x* n ] 



(13) 



Nf- 2) - N { d n - 2) 



')( 



N (n) _ N (n) 
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where P(g n — 1) is the probability that the corresponding 
component x n exists in y n . As Nt increases and/or Nd de- 
creases in dl3t . the correlation E [y n -2Vn] decreases (likewise 
for the variance-normalized correlation). The same is true for 
the noise correlation. 

Fig. |4] shows the example of correlation in the innovation 
sequence before and after the refinement through puncturing : 
E[x n ^2*n] vs E[y n _ 2 y^]. The sequence y„ may have a 
smaller number of observation samples, but its correlation 
is low as seen in Figj4] (b), which is useful to maintain the 
optimality of the Kalman filter. The parameter c controls trade- 
off : if c is large, the number of observation samples increases, 
which can be beneficial for ML estimation. However a large c 
can feed biased decision errors to the Kalman-based estimator. 

Note that the actual puncturing process is not fully random 
as our assumption made in d 1 3t . However, the puncturing 
happens irregularly, and an interesting observation we make is 
that irregular puncturing activity become more pronounced in 
broken (bad) packets. Once the decisions are incorrect, corre- 
lation between the components of x n appears, and puncturing 
becomes active. In order to salvage a bad packet from biased 
errors, the puncturing attempts to "innovate" the sequence x„. 
Moreover, in high SNR, random puncturing is not necessary to 
produce reliable decisions, because the signal term itself in © 
(without the noise and estimation error terms) is an innovation 
sequence. Also, the puncturing process in this context can 
also be viewed as an effort to prevent redundant information 
from circulating in the iterative signal processing. We observe 
that although the puncturing cannot completely remove the 
correlated errors, a significant portion of the biased-errors gets 
eliminated before the channel estimation step resumes. 



B. Kalman- Based Sequential Channel Estimation Algorithm 
with Punctured Innovation Sequence 

Once the punctured innovation sequence y n is generated, a 
linear channel estimator can be specified as a matrix A, that 
is, h = Ay„. The Kalman estimator is now derived as 

h„ = E[h|yi,y 2 ,...,y„] 

= £[h|y 1 ,y 2 ,..,y„_ 1 ]+.E[h|y n ] 

= h„_ x + A„y„, (14) 

where ^[ajb] denotes the optimal linear estimator of a given 
b. To find the linear estimator matrix A„, the orthogonality 
principle is applied: 



(h-A„y„)y* = 



H 

a i 



A„y n y^ = by 



(15) 



where an overbar also indicates statistical expectation. The 
right-hand-side of the last line in (TT~5T > is given by 



hy^ = (h-h„_ 1 )(h-h n _ 1 )« S* 



(16) 



where P„_i is defined as the channel estimation error variance 
matrix, and the term y n y„ in < TT3T > can be written as 



H 



SnCh-hn-iXh-h^^S* 



E„hh"E^ +M a l Nd . 



(17) 



Now using ( fl5l ). ( MoT ) and ( TTTb . the matrix A„ is obtained as 

An = hyF(y«yf ) _1 

= P„_ 1 S^(S„P„_ 1 S H + Q„+AA I Wd )- 1 



(Q n +AA I w J- 1 S, 



n-1 



(18) 



The next steps to complete the derivation process are to 
express P n _i and Q„ in a recursive fashion. Noticing (h — 
h„) = h— (h n _i + A„y„) from ( fl4l >. the channel estimation 
error variance at time n can be rewritten as 

P„ = {h - (h„_i + A n y„)}{h - (h„_i + A n y n )} H 



n-l) 



(h-h„_ 1 )y«A« 



-A n y n (h - h n -i) H + A„y„y«A I f 
Pn-i A n S n P n _j 
(I./Vt — A rt S n )P n _i, 



(19) 



where we utilized the relation y n y^ = S n P^_ 1 which 
is obvious from (fl5l l and ( fl6l i. Also note P rl is a symmetric 
matrix of which pivot has non-negative real values. 

Finally, Q„ needs to be found. The symbol decision error 
variance a 2 s — E[\s — s\ 2 } can be found by using the extrinsic 
probabilities (i.e. a 2 = J2 s -eA \ Si ~ ^| 2 ^( s 0)- Under the 
reasonable assumption of (sj — Sj)(si — Si)* — when i ^ j, 
the Nd x Nd diagonal matrix Q n is given as 



E„hh H E^ 



diag 



p^a 2 (n,0,t), 



^2^ iP ^a 2 (n,N d -l,t) 



(20) 



N d xN d 



where = \h^\ 2 . However, finding is a bit tricky 
as the channel state information is unknown to the receiver. 



The channel c orrelat ion matrix hh g , on the other hand, can 
be found from hh H = {(h - h„) + h„}{(h - h„) + h n } H , 
which reduces to P„ + h„b g . Utilizing this expression, we 
can write 



Q, 



E, 



= diag 



P, 



E 



.4=1 

(p„(M) + ^lil 2 )^(",^-M) 



(21) 



where ht[n — 1] is from the previous estimate h„_i, p n (t,t) 
is the t th diagonal element of P„_i, and a 2 (n,j,t) is the 
decision error variance of the (j,t) element of S n . 
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Putting it all together, for the receive antenna r, the proposed 
estimator is summarized as a set of equations : 



Q 



(r) 



diag 



£(*>»(*,*) + l^*li| 2 ) a 2 s {n,N d -l,t) 



(22) 

J N d xN d 



S^(QW+^ I JV- )~ 1 S n +pWl 1 
(qM+AAJ^)" 1 (23) 



aW _ 



h^ r) = h„_i+A„y„, 



(24) 
(25) 



where h_i corresponding to the initial time n = can be 
given by an initial channel estimator based on the use of known 
preambles. Also the initial matrix P_\ can be derived from the 
MMSE analysis GU) as pfT] = diag^n^^ | 2 /(7|n ( !'j t) | 2 + 1)] 
for t = l,..,iVt where 7 = E s /(N t Af ). We note that the 
channel estimation algorithm summarized in (l22l-(|25]l takes 
into account the quality of the soft decisions that are generated 
at various stages in the pipeline for a given processing time n. 
When t=l, the resulting algorithm becomes similar to the one 
presented in 11221 for the inter-symbol interference channel, 
as the gist of the algorithm of [22 1 is in incorporating the 
quality of the soft decisions as part of effective noise in the 
Kalman sequential updating process. The difference, however, 
is that in our algorithm, we do not assume that the operation 
of z n — S„h n _i makes the observation sequence automatically 
white, which, as argued above, would be faulty. Also, in our 
algorithm, varying qualities of the decisions generated from 
different processing modules at a given time are taken into 
account in the update process. More specifically, the effective 
noise covariance matrix of (l22t is a function not only of n but 
also of Nd which itself is a growing function of n initially (up 
to 2N Ur ). 

C. Noise Variance Update for the Soft Detectors 

A Kalman-based estimation algorithm, as the one proposed 
here, has the advantage (compared with, e.g., EM-like algo- 
rithms) that the channel estimation error variance is available 
for free and it is continually updated as a part of the recursive 
process. Realizing that the channel estimation error variance 
is a reasonable measure of how accurate the channel estimate 
is, this information somehow should play a beneficial role in 
the detection (or demapping) process. 

As the first step in utilizing the available channel estimation 
error variance, the observation equation of (ff] is recast with 
the channel estimation error shown explicitly: 



(fc) 



Hsl fe ) 



(H-H n )i 



(k) +n (fc) 



(26) 



A (fc) 



where superscript k points to a specific demapper out of 
the Nur demappers operating in the pipeline stages (fc = 



(k) 

l,..,Nit r ). Accordingly, s„ ' here corresponds to each odd 
row of S„ in (f2j). For the k th demapper in the pipeline, the 
noise variance is updated to include the channel estimation 
error: 

2" 



= E 



S W(H-H„) + n 



Cov(aW,aW)+A/;i 



(27) 



where 



indicates vector norm operation. The N r x N r 
covariance matrix Cov(a! fc \a^) can be obtained (with an 
understanding we are focusing on the k th demapper in the 
pipeline at time n, drop the indices k and n to simplify 
notation) as 

Cov(a, a) = E |(H - H) H s ff s(H - H)| 

~ diag 



Nt N t 

t=i t=i 



(28) 



where the approximation is due to the assumption that channel 
estimation errors and transmitted symbols are independent and 
that E[s^s n ] w E\s^s n ], Note that the updated noise variance 
is specified in matrix form because different RX antennas are 
subject to different channel estimate errors in the Kalman 
estimator. This is the same as saying each RX antenna is 
subject to a different amount of observation noise. Therefore, 
the demapper algorithm must properly be optimized for the 
given equivalent noise covariance matrix. 

The demappers in the pipeline utilize (|26| |. An A/-QAM 
symbol vector transmitted from N t -TX streams is demapped 
to one binary vector b = [b ,bi,--- ,&Qjv t _i] T . Using the 
updated noise variance, the likelihood function of the MIMO 
demapper is given as 



N r 

p(z\ B ) = n 




W_hCr) s |2 \ 

-h>> s | 2 ' 



(29) 



r=l 



where A/^ r ' is the noise variance corresponding to z^ r \ that is, 
the (r, r) diagonal element of matrix N Q . The k th soft MAP 
demapper in the pipeline directly gives out the posteriori LLR 
output Lp\ 

P{bi = l\z) 



Lp (bi) = In 



= In 



P(b_ 

s£A Nt \b i = l 



0|z) 

E P(z\ S )f\P(b 3 ) 



E P(z\s)UP(bj) 

seA Nt \bi=0 j& 



In 



P(bt = l) 



(30) 



P (k - 0) ' 
1 for the individual bits in the 



where i = 0,...,QN t 
transmitted symbol vector. 

The MMSE demapper solution can also be derived from the 
modified observation equation d26| i. The MMSE demapper can 
be shown to yield 

s = E[s] + S s H ff f HE s H ff + Cov(a, a) + Af l) 



z - he[s] 



(31) 
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where E[s] is a mean-symbol vector based on the apriori 
probabilities, and S s is given as diag[crf , . . . , <r 2 N _J. 

IV. Mean Squared Error (MSE) Analysis 

In the MSE analysis, we try to understand 1) the impact 
of biased soft decision errors, and 2) when the soft decision 
error is unbiased, the performance impact of mismatching the 
soft decision error variance in the estimation channel process. 
Through the MSE analysis, we also investigate the impacts 
of the number and quality of decisions used in the estimation 
process. 

The soft decisions fed back from the detectors and decoders 
are assumed to have potential errors and are written as 



s[d, n] = s[d, n] + e[d, n], 



(32) 



where d = 0, .., N d —1. As discussed in Section lTlI-Bl the fed- 
back soft-decisions may contain biased decision errors. So the 
decision error e d is modeled as 
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Fig. 5: Threshold parameter c optimization 



e[d, n] 



■q[d,n], 



(33) 



Meanwhile, when m/0 with the same decision error power 



where m is a non-zero-mean random variable, and q is a white o g , the MSE with the biased decision error is calculated as 
Gaussian noise with zero mean and variance a] 2 . Also, denote 



a s = E[\e\ ]. For both biased and unbiased cases, assume that 
the total decision error power a 2 is identical. Also assume 
correlations of the bias mean with the symbol as well as with 
the channel are zero (i.e. E[sm] = and E[hm] = 0). 

The proposed estimator is designed based on the linear 
MMSE (LMMSE) criterion. For the MISO communication 
channel of (fJJ, the LMMSE estimator is expressed as 



N T 



— aW 7 W 

- R Moff fa rM5 h 



s"s r 



(34) 



(35) 



where V 



(r) 



»1 



IjV d With V [ n ] = (ct* „Ei=l P 



and R 

,(rj 



(r)-l 



■N ) 



E{h^h^ H }. Also, denote * = S^S r 



VnHf^' 1 . The estimator of (f35T > is optimum under unbiased 
decision errors (m = 0), and the minimum estimation error 
variance of the MIMO LMMSE estimator is obtained as 



(36) 



slnMaseM = £ E { || h « - h« 1 1 2 } 
r—1 

= E tr {( § « v « r) " 1§ " +R ^")" I' 

where ^unbiased * s me estimation error variance of the opti- 
mum MMSE estimator d35l >. As Nd increases, it is reasonable 
to write S H S = N d E{S H S} = N d (E s + cr 2 )l Nt . Accord- 
ingly, we have 



f 2 =f 2 = \ \ 

opt unbiased / j / _j 



(37) 



-2 

'biased 



E 



{||h< 



r) 



i (r) I 
biased I 



(38) 



A-bi ased Z 



and 



^-biased 



h ) ^biased tnat utilizes soft-decisions 



where h bmsed 

(Sbiased^biased + vR 

with correlated error. Note that the correlation matrix 
of decision errors is i?[E ff E] = N d a 2 lNt + N d &N t , 
where &N t is a matrix with all diagonal elements set to 
zeros and all non-diagonal elements to \m\ 2 . Assuming 
a very large N d and applying a matrix inversion lemma 
(X + Y)- 1 =X" X -X- X (X- X +Y- 1 )X- 1 , we can write 



(pUased^biased + ^raR/i 



(N d (E s + a 2 ) I 



Nt 



(39) 



=A 



Using (O and the facts tr (* _1 *jvtR/i) = and 
tr (*jvtR/i) = 0, the MSE of the LMMSE estimator suffering 
from correlated decision errors is finally expressed as 



Nr 

EM 



R 



0) 



:(>■) 



unbiase< 



tr{(N d (E s+ a 2 )A^)} 

[(N d (E s +a 2 ] 



N r 



'unbiased 



r—l 



(40) 



Note that AR|, is a semi-positive definite matrix, and therefore 

^biased > Unbiased when H + °' This confirms the loss due 
to correlated decision errors, even if the error power is the 
same. In effectively whitening the correlated decision error, the 
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Fig. 6: Open-loop channel estimation MSE for different values Fig. 7: Open-loop channel estimation MSE depending on 



of aj (N d = 12) 



different values of N d (erf = 0.1) 



constant c is a crucial parameter that determines the number of 
selected symbols N d and thus controls the trade-off between 
the observation sample size and the amount of error correlation 
in the channel estimator. The existence of an optimum value 
for c is also shown through the MSE simulation results of 
Fig. [5] Based on Fig. [5] we set c = 2 for the 3x3 and 
4x4 SM-MIMO-OFDM systems, and c = 2.5 for the 2 x 2 
SM-MIMO-OFDM system. 

Even with unbiased decision errors, the LMMSE estimator 
suffers performance degradation when the noise variance is 
mismatched. Let us quantify the MSE penalty associated with 
not accounting for the uncertainty inherent in the soft decisions 
in the form of increased noise variance. The LMMSE estimator 
failing to consider the soft decision error can be described as 



S n s„ 



(r)-l 



(41) 



(42) 



Utilizing (|37| > and denoting An' = hn^ — Wn\ the estimation 
error variance of the estimator d42l can be shown to be 



el, \n] 



5>{||bM-*W|| a } 

r=l 

^tri?{||hW-(h«-A(;))|| 2 } 

r=l 

N r 

4 t N + ^tr{s{h«hW ff 

r=l 

1 n n n n f f 



yv n vv n 



(43) 



To simplify notation, the indices r and n are tempo- 
rally dropped. As the number of iteration increases, the 
matrix inversions in (l35T l and (l4Zt can be simplified as 



(S ff S + ^R" 1 ] = diag[ Pl /( Pl N d (E s + a 2 ) + «),..., 

p N J(p Nt N d (E s + a 2 ) + v)] and (s H S + M.RJ" 1 ) ~* = 
diag[pi/(piA d (i; s + erf) + J\f ),...,p N J (p Nt N d (E s + a 2 ) 
+ Af )], where the subscript for p for the time being indicates 
the TX antenna. Also, noting E{h(h - h) H } = by the 
orthogonality principle, it can be shown that 

E{hh H } = R h S H S (s H S + vR h 

= diag 



p\N d E P 2 Nt N d E 
Pl N d {E s +cr 2 )+v> p Nt N d {E s +aj)+v 



(44) 



N t xN t 



We also write 
E{ww H } 

-0 



S H S + M R^j S H (SR h S H + M I Nd ) 

• s(s h s + a/;r / ;^ 

diag 



pjN d ( Pl N d E 2 s + Pl ,a 2 s E 3 +Af (E s +cr 2 a )) 
(piN d (E s +a 2 )+N ) 2 : 



(45) 



P 2 Nt N d (p Nt N d E 2 +p s <J 2 S E 3 +M (E 3 +a 2 )) 
(pN t N d (E s +a 2 )+Af ) 2 



N t xN t 



where p s = 53t=i Pt- Finally, substituting (l44t and 
in d43]) and also noting E{wh H } = E{hw H }, the MSE 
convergence behavior of the estimator d4Jt can be shown to 
be 



lim e 2 w [n] 

N d — too 



= £ optl n \ 



P^Es 



1 - 



(46) 



from which it is easy to see that the mismatched MSE is an 
increasing function of the soft decision error variance a 2 . 

To develop insights into the performance sensitivity off the 
sequentially updated channel estimator against the variations 
of the parameters a 2 and Nd, we resort to an open-loop 
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Fig. 8: EXIT charts for the 2 x 2 SM-MIMO-OFDM turbo 
receiver at SNR=14 dB 

investigation. For this, the decision-feedback channel estimator 
is modified in such a way that unbiased soft decisions with 
various a 2 and Nd combinations are artificially generated 
for the channel estimator. A 7-iteration IDD receiver for the 
2x2 16-QAM MIMO-OFDM system is used for this test, 
but instead of using actual feedback from the demappers and 
decoders, artificially generated soft-decisions are provided to 
the channel estimator of (I22l -(|25]|. 

Fig. [6] and Fig. [7j show the MSE performance depending on 
the decision quality a 2 and the number of feedback decisions 
Nd with an assumption of uncorrected feedback decisions. 
The signal power is fixed at E s = 1 and the channel SNR 
at 14 dB. With N d = 12, the packet error rate (PER) due to 
imperfect CSI became negligible when a 2 ps 0.1. In Fig. [71 it 
is seen that reducing the number of feedback decisions, Nd, 
while fixing the decision quality causes the MSE to increase. 

V. Performance Evaluation 

The proposed algorithm is investigated through an extrinsic 
information transfer (EXIT) chart analysis and packet error 
rate (PER) simulation. Performances are evaluated for 2x2, 
3 x 3 and 4 x 4 16-QAM SM-MIMO-OFDM systems. The 
transmitter sends a packet with 1000 bytes of information. 
The SISO MAP-demapper is used for the 2 x 2 SM-MIMO- 
OFDM system, whereas the SISO MMSE-demapper is used 
for the 3 x 3 and 4 x 4 SM-MIMO-OFDM system |6] due 
to complexity. A rate-1/2 convolutional code is used with 
generator polynomials g Q = 133s and gi = 171s, complying 
with the IEEE 802.1 In specifications [26]. The SOVA is used 
for decoding. The MIMO multi-path channel is modeled with 
an exponentially-decaying power profile with T rms = 50ns 
uncorrected across the TX-RX links established. 

A. EXIT and PER Performance Comparisons 

The EXIT chart is a well-established tool that allows the 
understanding of the average convergence behavior of the 
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Fig. 9: PERs with different channel estimators: 2x2 SM- 
MIMO-OFDM (7 iterations) 

mutual information (MI) in iterative soft-information process- 
ing systems (28). Fig. [8] shows the results of an EXIT chart 
analysis on various competing schemes. A 2 x 2 SM-MIMO- 
OFDM system is used for this, and an SNR of 14 dB is chosen. 
Ia ± and Ie 1 measure the MI at the input and output of the 
demapper, respectively, whereas Ia 2 and Jg 2 are the respective 
MI at the input and output of the decoder. At the next iteration 
stage, Ie 1 becomes Ia 2 an d Ie 2 turns to Ia x ■ 

In the figure, the top-most curve indicates the average trans- 
fer function of MI through the demapper and the bottom-most 
curve is the same function for the decoder. Both the demap- 
per and decoder EXIT chart curves correspond to Gaussian- 
distributed input LLRs, and the demapper EXIT curve is also 
based on the assumption of perfect channel estimation. The 
stair-case MI plots represent actual MI measured during IDD 
simulation runs and shows how the MI improves through 
the iterative process for three different channel estimation 
schemes. The gap between each stair-case MI trajectory and 
the demapper EXIT curve represents the performance loss 
due to imperfect-CSI. The solid stair-case line represents 
the proposed channel estimation algorithm. The dashed-line 
(labeled "Song") corresponds to the Kalman channel estimator 
of (23 1 applied to the conventional-IDD setting (non-pipelined 
IDD with a demapper utilizing the noise-variance update of 
d27l ) with channel estimation using only the decoder output 
decision). The dotted line is for the demapper utilizing only the 
preamble-based initial channel estimation (following the IEEE 
802.1 In format, where a fixed number of initial preamble sym- 
bols in the high-throughput long training field is utilized). For 
the proposed scheme, the MI trajectory measurement is taken 
from the last demapper in the pipeline, as the last demapper 
block best reflects the quality of the final decisions. It is clear 
that the proposed punctured-feedback Kalman estimation with 
pipelined-IDD shows superior MI convergence characteristics. 
The scheme of [23| fails to improve MI beyond nine iterations. 
With the demapper utilizing only initial channel estimation, the 
trajectory fails to advance earlier in the iteration. 
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Fig. 10: PERs with different channel estimators: 3x3 SM- Fig. 11: PERs with different channel estimators: 4x4 SM- 



MIMO-OFDM (7 iterations) 



MIMO-OFDM (9 iterations) 



Fig. [9] shows PER performances of the receivers with 
different channel estimators in the 2 x 2 SM-MIMO-OFDM 
system. Seven iterations are applied beyond which the iteration 
gain is plateaued. The performance gap between perfect CSI 
and preamble-based initial CE only is nearly 3 dB at low 
PERs. It can be seen that at low PER the proposed esti- 
mator almost compensates for the loss due to imperfect-CSI 
when the threshold parameter is set at c = 2.5. Although 
the performance with small c has inferior performance at 
low SNRs, the proposed Kalman CE curve with c = 2.5 
crosses the c = 4 curve as SNR gets higher. The large c is 
effective in averaging noise in low SNR, but allows relatively 
large correlated errors. As expected from the EXIT chart 
analysis results, the Kalman estimator of ll23l that utilizes 
only the decoder output in a non-pipelined setting does not 
perform as well. As one of the algorithms considered for 
comparison, the decision-directed EM estimator (referred to 
as EM-DD here) introduced as a variant of the EM esti- 



-S„h 



(r) 



(r) 



S„h 



As can 



mator in II 1 91 is applied with h^n = (s^S„^ 
In addition, the EM estimate is blended with the preamble- 
based channel estimate by a combining method (i.e., hn = 
a nhprll mb i e + b n ho [n]) [[20 1 . A method to find the com- 
bining coefficients a n and b n is discussed in [20|. The EM 
noise variance update method is presented in |[P9l as N Q [n] = 

l/N r N d J2 £ 

r=l d=0 

be seen, this scheme also does not perform as well as the 
proposed algorithm. 

Fig. [10] and Fig. [UJ show PER curves for 3 x 3 SM-OFDM- 
OFDM and 4x4 SM-OFDM-OFDM systems, respectively. 
These figures tell a consistent story. Namely, the initial- 
CE-only scheme suffers about a 3dB SNR loss relative to 
the perfect CSI case. The proposed schemes close this gap 
significantly, outperforming both the Kalman-based algorithm 
of (23] and the EM-based algorithm of [19|. As for the 
proposed channel estimation scheme, a more aggressive punc- 



turing (corresponding to a lower c value) tends to give a 
lower PER as SNR increases. Before finishing this section, 
we briefly mention complexity. For all considered channel 
estimation schemes - the proposed, the Song method and the 
EM-DD scheme - implementation complexity largely arises 
from the matrix inversion operation. All schemes require 
matrix inversions of the same dimension. Consequently, the 
proposed method and the Song method require complexity 
that roughly grows as 2N r x 0(N?) whereas the EM-DD 
requires complexity proportional to just 0(Nf). This is due 
to the consequence that both our method and the Song method 
require matrix inversion for each receive antenna, whereas the 
EM-DD method needs matrix inversion just once and can be 
used for all receive antennas. The factor 2 accounts for the 
fact that two matrix inversions are required for each update of 
the Kalman gain in the proposed and Song methods. 

VI. Conclusions 

A sequential soft-decision-directed channel estimation algo- 
rithm for MIMO-OFDM systems has been proposed for the 
specific pipelined turbo-receiver architecture. The algorithm 
deals with observation sample sets with varying levels of 
reliability. In coping with decision errors that propagate in the 
pipeline, we have introduced a novel method of innovating 
a correlated observation sequence via puncturing. Based on 
the refined innovation sequence, a Kalman-based estimator 
has been constructed. The proposed algorithm establishes im- 
proved Kalman-based channel estimation where the traditional 
innovations approach cannot create a true innovation sequence 
due to soft-decision error propagation. The EXIT chart, MSE 
analysis and PER simulation results have been used to validate 
the performance advantage of the proposed channel estimator. 
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